
1 <110> APPLICANT: Korea Research Institute of Bioscience and Biotechnology 

2 Dong Kook Pharmaceutical Co. 

3 Rhee, Sangki 

4 Choi/ Euisung 

5 Rang, Hyunah 

6 Sohn, Junghoon 

7 Bae, Junghoon 

8 Kim, Moowoong 

9 Agaphonov, Michasel 

10 <120> TITLE OF INVENTION: Hansenula polymorpha mutants and process for the preparation 

11 recombinant proteins using the same 

12 <130> FILE REFERENCE: 4220-116 US 

13 <140> CURRENT APPLICATION NUMBER: US/09/993,192 

14 <141> CURRENT FILING DATE: 2001-11-14 

15 <150> PRIOR APPLICATION NUMBER: US/09/674 , 617A 

16 <151> PRIOR FILING DATE: 2001-01-03 

17 <150> PRIOR APPLICATION NUMBER: PCT/KR00/00173 

18 <151> PRIOR FILING DATE: 2000-03-04 

19 <160> NUMBER OF SEQ ID NOS : 9 

20 <170> SOFTWARE: Patentln version 3.0 

22 <210> SEQ ID NO: 1 

23 <211> LENGTH: 2218 

24 <212> TYPE : DNA 

25 <213> ORGANISM: Hansenula polymorpha DL1 

26 <300> PUBLICATION INFORMATION: 

27 <308> DATABASE ACCESSION NO: U67174 

28 <309> DATABASE ENTRY DATE: 1996-08-17 

29 <313> RELEVANT RESIDUES: (1)..(2218) 

30 <400> SEQUENCE: 1 



31 ctcgagtatg attactttgt cttaattttt atttggtgta attttttact accattttaa 60 

32 tgacgtggtg tacgggtgtg aatacataaa cacctcagta gccatagcag ctaccatgga 120 

33 tgatagggca cttttggaaa aggaatctga aaacaagaag aagcacggca tacggtggtc 180 

34 aacagctcaa gaaaggccaa tgcgataaat ctcattttta tacgaacacc caccgaagcc 240 

35 cttatcggtc ttttgacgca gcagcttaat tatctgaggc tggttatcaa tttttgcctt 300 

36 ccagataaaa, tatattttcc ctatttatat tccatccgtt tatcctaggc aatcgtccaa 360 

37 aaaaacaaga attctgttat atttcaattc ccattatgaa gctctcgatg tctattctgg 420 

38 ccctagtggc cactagcatt gggtcggcgc aagccgctgc catcaaaaaa gataccgcgg 480 

39 ggcaacaccc cctcggaatg aactccaact ttggcgatgt tttcgtggag aaaggcaagt 540 

40 ctttgctcga ccgtgtttct gaggttgtga gcgagtcgtc caagaatatt tccccagaat 600 

41 tgagggacat ttggaatgaa atggaggcca agttcccaga taccctcaga aacatgaagc 660 

42 tcaagtcaga gccggcagtc aaaattacaa agaaacctgc cgatttttgg gacttcaatg 720 

43 ttctcaatga gaagttctcc aactacaagc tgagggttaa gaagaccgac ccgggagcat 780 

44 tgggactgga ccacacaaga cagtactctg gatacttgga tgtggaggac gaagacaagc 840 
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45 atttcttcta ttggatgttt gagtccagaa atgacccggt caacgaccct gtgattctgt 900 

46 ggctcaacgg tggtccagga tgctcttcct tgactggaat gctttttgag ctcggctctg 960 

47 cttctatcgg tccagatctc aagccaatca acaacccata ttcgtggaat tccaatgcca 1020 

48 ctgtgatttt ccttgaccag cctgtcaatg ttggattctc gtactcttcc aagtctgttt 1080 

49 ctaacacggt cgcagctggt aaagacgtct atgctttctt ggagttgttc taccagcaat 1140 

50 tcccacactt gctgaagaac gacttccaca tcgccggcga gtcgtacggt ggtcattaca 1200 

51 tcccagtgtt tgcctccgag attctcaccc atgctgacag atctttcaac ctcacttcgg 1260 

52 tgttgattgg taacggtttg accgacccac ttaac'cagta cccattctac gagagaatgg 1320 

53 catgctctac tgatggtggc tatgagcaac cctggacgag tctgagtgcg aaggaatgtt 1380 

54 tggagacctt gcctagatgt ttgtcattga ttgaatcatg ctacagctcg cagtctgtgt 1440 

55 tctcatgtgt cccggcctcc atctactgca acaacgcaca acttggacca ttccaaaaga 1500 

56 ccggcagaaa cgtctacgac gttagaaaga tgtgcgaggg aactctgtgc tacaaagaca 1560 

57 tggaatacat tgaccaatat ttgaaccagg actttgtcaa ggaaaaggtt ggcgctgagg 1620 

58 ttgacactta cgagtcgtgt aatttcgacg tgaacagaaa cttcctgttt gctggtgatt 1680 

59 ggatgaaacc ttaccacaag aacgttatca atctgctgga gcaaggtctt cctgtcctga 1740 

60 tttacgcagg agacaaggat ttcatctgca attggctcgg aaaccaagcc tggtccaatg 1800 

61 agctcccttg gtctggacac gatgaattcg agtcccccga gctgtacaac ctcaccttga 1860 

62 aggatggcac taaggtcggc gaggtcaaga atgctggcaa gttcaccttt gctagaatgt 1920 

63 ttgatggagg acacatggtt ccatacgacc agcctgagag ctctttggct atggtcaata 1980 

64 gatggatagc tggtgactac tccttgggaa ccaagaaata aaaagctcat gaaaacatta 2040 

65 tgtcattgtt gttcaactta tgtatgattg tttgttaaga tacactatct aaattcctcg 2100 

66 ttcaataccc agcctttccg ttccttgtcc catttacata ttgatttgag catcgatcgc 2160 

67 acaaccttga ctgttttttc gtttgagacg tcaatcttca gctcgtcaag aactgcag 2218 



69 <210> SEQ ID NO: 2 

70 <211> LENGTH: 2107 

71 <212> TYPE: DNA 

72 <213> ORGANISM: Hansenula polymorpha DL1 

73 <300> PUBLICATION INFORMATION: 

74 <308> DATABASE ACCESSION NO: U67173 

75 <309> DATABASE ENTRY DATE: 1996-08-17 

76 <313> RELEVANT RESIDUES: (1)..(2107) 

77 <400> SEQUENCE: 2 

78 gagctcagct accttttggg agccatggca 

79 atggtacaaa tgggattaca acaaaggtac 

80 gatgagaaga gaaagtatat ctgcggagaa 

81 tgccacaggg tcagcattca ttaaaatccc 

82 tgaaaacacg ccactgtcgc ctatcgactt 

83 ctttacaagg gcaaaaaaca gttcgatgag 

84 tcttaatcag atcacttgcc tattctggaa 

85 atgctcaaaa ctgatgtgtt tatgtgtgac 

86 gagtaaaatt tggggttcat taagtaggtt 

87 agtcaccaat ccagttgccc caaaaccttg 

88 ataaacaaag cgaatctcaa agatgaaact 

89 tgtgcttgga ttggtctccc tggccgatgc 

90 tcctgcacag gctacttacc aggatgttac 

91 aaagtatgtt acaacttaca acaagtttat 

92 ctcgatcggc aagcgttctg acgagagtgc 

93 ctatctcaat gcccaatact tcaccgagat 

94 ggttatcctc gacaccggtt cctctaacct 



cggtggcctt gattgtacgt tgctcctgca 60 

cgtcaaatct cccgaaatag aagcacccgc 120 

ttacatcaaa ctgaaaaagt caaagtcgcc 180 

aatgtccccg catcacgttc gcaagctgag 240 

tcttctggac gactatatgg cgaaccacaa 300 

gttttgaagc tgtcaaacta cggatcgata 360 

gccgcaccag tactagttgg gatagcgtaa 420 

gcactaccta aaatatgagg gtggcaaaga 480 

ccataattta caaacctgtt ctgaatagag 540 

tcaaaatcat tagcatcgtg agcatctatt 600 

ctctttccca accctttact cgcttgctct 660 

caaggtccac aaagctccca tcaaaaaagc 720 

tgtcggcgat tatgttgagt cgttgaagca 780 

cgccgcccaa cagaatgacc agcaaattat 840 

aagctcgggt cacaacactc ctttgaccaa 900 

tcaattgggc actcctggcc aatcgttcaa 960 

gtgggttcct agtagcgatt gcacgtcctt 1020 
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95 ggcctgttac ttgcatacta aataeyacca cgatgagtcg tccacttacc agaagaacgg 1080 

96 ttcttcgttt gctattcaat atggctcggg ttccctcgag ggttacgttt cccaagacac 1140 

97 actgactatt ggtgaccttg tcatccctaa acaggatttc gccgaggcta ctagcgagcc 1200 

98 tggcttggct tttgcttttg gaaagtttga cggtattctg ggtctggctt acgacacgat 1260 

99 ctcggtcaac agaatcgttc ctccaattta caatgctatc aatttgggat tgctggacac 1320 

100 cccacaattc ggattttacc ttggtgacac ttcgaagtcc gagcaggatg gaggagaggc 1380 

101 tacctttggt ggatacgatg tgtctaagta cacaggcgat atcacctggt tgccagtcag 1440 

102 aagaaaggct tattgggaag tgaagtttag tggtatgccc ttggtgacga atacgctcca 1500 

103 ttggagaaca ccgggagctg ccattgatac cggaacctct ttgattgctc ttccatctca 1560 

104 attggctgag attttgaact ctcaaattgg tgccgagaag tcatggtctg gacagtacca 1620 

105 gatcgattgt gacaagagag actcactgcc tgacctcact ttcaacttcg atggttacaa 1680 

106 ctttaccatc tcgccttatg actacacttt ggaagtttct ggttcttgta tttctgcatt 1740 

107 cactccaatg gacctcccag ccccaattgg tccaatggcc atcattggtg acgctttcct 1800 

108 cagaagatac tactctgtct atgaccttgg cagagacgct gtcggattgg ctaaggctgt 1860 

109 gtaatgcaag gcagttacgt attagtttca acttgctagc atgaatttct atcgggtacg 1920 

110 gttgcttgac aacgattttc tttgtgtctc tattactgct ataaaacatg aaaagattgg 1980 

111 atttttttga cagtttctag taattgctta aatctgctct tttccttttg ttgatcccgt 2040 

112 gctgctctgt gtgcttcctt tttttatcta tattattaga ttgtctcttc tttagcagcg 2100 

113 gaagctt 2107 



115 <210> SEQ ID NO: 3 

116 <211> LENGTH: 3550 

117 <212> TYPE: DNA 

118 <213> ORGANISM: Hansenula polymorpha DL1 

119 <400> SEQUENCE: 3 



120 gaattctata ggaatggtgc aaaacgaagg atcgtgtgga ctcatgaatg tgctatattt 60 

121 gtggctaaaa agcctgccag agctcgcagg agataagagg tctggcgtcg tgctgcgcaa 120 

122 aatgatcgcc ttgataaaat tcctcaatct cgatcatgaa tcagggttca agtattttgg 180 

123 ttttttcttt caagacgagt atgaactgga acacgctctc gagtcctaga cgaccaggac 240 

124 atcaagcagg cggccgcact cgaagatctc gccgcttttg atttccaaca ctttgacgtc 300 

125 gtggaggttg accagaccct tttcgatgcc tatgagctcg agctcggtct cgaacaggtt 360 

126 tcctgggtct aacgtgaatt tcaggttgtt ggtaagagaa atcagcccca ctttgctgaa 420 

127 ctcgtgttgg aaagcccgca ccagctggtt gaggccgtaa agcgggacgg gatctgcctt 4 80 

128 gttggcgttc tggatcggaa ttggtggcac agatttttcg tattgtttgt tgatcgatga 540 

129 ttgaatgtac agtatgagcg ataacgagtc gttcgacttg ttgataaatt gtagcttcca 600 

130 tttgaacacc tcgccgagct tgactttagt ctggcccgaa actgaaatgc tgaggttctt 660 

131 cttggagtgg ttgagagtca ccgacgagtt ggatcgtagt ttatttttct taggcaaatt 720 

132 cagagcagag ttggtcgact tggatttaaa taccccgaga ctaggcgacg cgattggggg 780 

133 cacggtgggg gccgccataa aatccacatt tgtagtccat cgtgtcacga tttttctatt 840 

134 cccgttgacc gtgcaatgga tggtgacaaa aaccggctta acccacttga tgtcgttgtt 900 

135 tgttagtctg tacgtgagcg acagaataga atggttctgc aacaccaatg gatactttac 960 

136 caccgagtag ccggcacaca gacagtttct gatgtccata tctaccgact cgattctcac 1020 

137 cgacacttct gacgcccgga cctgtccaga aagagccact tccaaagagg ccatggtttt 1080 

138 ctccgtcttg atggtcttga accgcatctg gaatacgggg ctgatccgca gcgacgtgtc 1140 

139 ggccgttacg agttcgtaaa ggtccacatg cttcgtcagc acagtgttgc ctatagtgac 1200 

140 tgtatcctgc aactctaaat tttctatctt tctcgactgt tgtcgaaacg aggctttgat 1260 

141 ctgcaattga tttgaaagtt tgtcaggcac cacggtcgtt tccagtatcc atacggcctc 1320 

142 agaccctatg acgactttgg aatcgagcac ttttctcgct gtgttttgca agttgtatac 1380 

143 caggtcgacc tcgacctgtg tcacagacga cggagtgtac accacgactt ggatgtcctc 1440 

144 gtcgaaatag gcaatgtcca gcgtgtcgca ctggcgcagc tgttccacgg agcccagcac 1500 
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14 5 atcagcagaa acccgggttt tcggaacaag cacaccaaac attggaaaca acacttatcg 1560 

146 acgcgggtgt gtttaaaaaa taatattaga tcgactgctc ctactctttt aaattcttta 1620 

147 tttatcccag gcttagacag cacagagcta accatgagtc ggtatttttt actagcgtgc 1680 

148 acattggctc tgcaatgtgt cgcggcctcc caggaggact acatagtcaa ggatttgcct 1740 
14 9 ggactctcga atattcctgc cgtggtgagg ccagtgatgc atgccggaca ccttgaaata 1800 

150 gatgaggaac acaacaccga gctgtttttc tggcgattcc agaatccgaa gaacaacggg 1860 

151 acacaccaaa cgctccaccg caatgagctg atcgtctggc tgaacggtgg ccctggctgc 1920 

152 tcgtcgatgg atggagccat gatggaaaca ggccccctgc gggtgtcaga caagttggag 1980 

153 gtggagctca acccgggatc ctggacacag gtggccgata tcttgtttgt ggatcagccg 2040 

154 gccggaattt tgagaaagtg gagtgcgtca aaatattcca tagtattttg gctgccagta 2100 

155 gagacgaaac caagcctgcc aaagaacaat gtgtcaacat gtacgactac cgcaaacatg 2160 

156 attacttccc tgcatgcgga tccaattggc cagaaggcct gcccaccgtg actaaatttc 2220 

157 tgaatctgga tgctgtgcaa aaagccctga acctgaaatc ggcgaagaga tggcacgagt 2280 

158 gtgacggaaa agtcgaattc tttttccagc cagagcactc tgtcaagtcg ttcgacctgc 2340 

159 tgccaaaact actggagaaa atgaaaatcg cgctgtttgc tggcgacaag gacattatct 2400 

160 gcaaccacaa atcaatagaa atggtgattg aaaaattgca aattacacca ggacaattcg 2460 

161 gtttcacaaa cagcagaaag tctggctgga tctacgacgg gcaggaagtg ggtgaggttg 2520 

162 agacacagtc gaacctcacg tatatcaagg tgttcaattc gtcgcatatg gtgccttacg 2580 

163 acctgccgga ggtgagcaga ggactattcg atataatcac aaactctatt gaaaaacggt 2640 

164 cgacagacat tgtgacgccc gtttatgaca gcagaggcaa ctacaagttt gtggaggaga 2700 

165 agcaggacac cgaccagaac gaggaagaag agaaggagaa acctcccaag caccatcata 2760 

166 gtctcacgtt ttacgtggca gaggtggcga ttctggcagt gctagcatat ctgctttaca 2820 

167 gcttctacaa atcgttcgcc aaatcgcgta agtctgcatt tttgtcactc tcttccaaga 2880 

168 agaagaagaa gcaggtgcac tggtttgacg agagcgacat aggaatggac caggaagctg 2940 

169 gcgaggcaga tcataagcct aagagcatgc tggagtcggt gttcaacaag ctgggctatg 3000 

170 gaggacagta tgacacggta caggacggcc gtgacatcga gatggcgccg gtcgaagaac 3060 

171 acgaagacca atttattatt caaagcgacg aggaagagtt tggccacaga tagagtacga 3120 

172 ttatataata tccacgaaaa attagtttcc aattatttcg ctcctgattg taatgtctca 3180 

173 aagttggatc ctctggaagc ggtccctgat taccggcggt ggaattattg gaagcggtgt 3240 

174 gctactgtac aaattcacga ctcctaccga ggaacagttg atcgctaaat tatcccccga 3300 

175 gctgagagca gactatgaac gcaacaaaga attgcggagg aaagaacagg agatgttgat 3360 

176 ggagatcgtc aaacagacgg cagcgtcgaa cgatccggtg tggaagacag ggccgattgt 3420 

177 gtcgccgtgg gaccgggact ttacgccatc gagggaaagt ttattagtga agagagagcg 3480 

178 atttgagaaa gagcaggcgg agaaaaaaca gcgcgaagag ctcgagcgtc tgaaagccga 3540 

179 ggccaagctt 3550 



181 <210> SEQ ID NO: 4 

182 <211> LENGTH: 17 

183 <212> TYPE: DNA 

184 <213> ORGANISM: Artificial Sequence 

185 <220> FEATURE: 

186 <223> OTHER INFORMATION: PCR primer for S. cerevisiae PRC1 gene 

187 <400> SEQUENCE: 4 

188 atgaaagcat tcaccag 17 

190 <210> SEQ ID NO: 5 

191 <211> LENGTH: 17 

192 <212> TYPE: DNA 

193 <213> ORGANISM: Artificial Sequence 

194 <220> FEATURE: 

195 <223> OTHER INFORMATION: PCR primer for S. cerevisiae PRC1 gene 
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196 <4U0> SEQUENCE: 5 

197 ttataaggag aaaccac 



17 



199 <210> SEQ ID NO: 6 

200 <211> LENGTH: 17 

201 <212> TYPE: DNA 

202 <213> ORGANISM: Artificial Sequence 

203 <220> FEATURE: 

204 <223> OTHER INFORMATION: PCR Primer for S. cerevisiaae PEP4 

205 <400> SEQUENCE: 6 

206 atgttcagct tgaaagc 17 

208 <210> SEQ ID NO: 7 

209 <211> LENGTH: 16 

210 <212> TYPE: DNA 

211 <213> ORGANISM: Artificial Sequence 

212 <220> FEATURE: 

213 <223> OTHER INFORMATION: PCR primer for S. cerevisiae 

214 <400> SEQUENCE: 7 

215 tcaaattcgt ttggcc 16 

217 <210> SEQ ID NO: 8 

218 <211> LENGTH: 27 

219 <212> TYPE: DNA 

220 <213> ORGANISM: Artificial Sequence 

221 <220> FEATURE: 

222 <223> OTHER INFORMATION: PCR primer for H. polymorpha KEXl 

223 <400> SEQUENCE: 8 

224 tggytsaacg ghccwgghtg ytcbtcb 27 

226 <210> SEQ ID NO: 9 

227 <211> LENGTH: 30 

228 <212> TYPE: DNA 

229 <213> ORGANISM: Artificial Sequence 

230 <220> FEATURE: 

231 <223> OTHER INFORMATION: PCR primer for H . polymorpha KEXl 

232 <400> SEQUENCE: 9 

233 wggratgtay tgwccrgcgt avgactcdcc 30 
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Invalid Line Length t 

The rules require that a line not exceed 72 characters in length. This includes spaces. 

Seq#:l; Line(s) 10 
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